Inferring Approximate Functional Dependencies from Example Data
نویسندگان
چکیده
This paper proposes a kind of PAC (Probably Approximately Correct) learning framework for inferring a set of functional dependencies. A simple algorithm for inferring the set of approximate functional dependencies from a subset of a full tuple set (i.e. a set of all tuples in the relation) is presented. It is shown that the upper bound of the sample complexity, which is the number of example tuples required to obtain a set of functional L:"-, dependencies whose error is at most c with a probability of at least 1 -6, is O(~/~V~), where n denotes the size of the full tuple set and the uniform distribution of examples is assumed. An experimental result, which confirms the theoretical analysis, is also presented.
منابع مشابه
Approximate Functional Dependencies for XML Data
Functional dependencies (FDs) are an integral part of database theory since they are used in integrity enforcement and in database design. Recently, functional dependencies satisfied by XML data (XFDs) have been introduced. In this work approximate functional dependencies that are XFDs approximately satisfied by a considerable part of the XML database are defined and the problem of inferring su...
متن کاملModeling Inhomogeneous High-dimensional Data-sets: with Applications in Learning Large-scale Gene Correlations
In functional genomics studies, observations often demonstrate substantial correlations that result from clustered or longitudinal aspects of the data being sampled. We show that a simple estimator for the correlation structure among samples can be used to eliminate local dependencies, when the assumption of mutual independence of observational units does not hold. We provide a simulation study...
متن کاملInferring dependencies from relations: a conceptual clustering approach
In this paper we consider two related types of data dependencies that can hold in a relation: conjunctive implication rules between attribute-value pairs, and functional dependencies. We present a conceptual clustering approach that can be used, with some small modifications, for inferring a cover for both types of dependencies. The approach consists of two steps. First, a particular clustered ...
متن کاملRecovery of Functional Dependencies from Data-Intensive Business Programs
Data integrity constraints usually form a major component in a data-intensive business system. To successfully reengineer a data-intensive business system, its data integrity constraints must be understood. Functional dependencies constitute an important type of data integrity constraints. Due to the nature of system development, a significant amount of functional dependencies is usually omitte...
متن کاملA genetic algorithm to discover relaxed functional dependencies from data
Approximate functional dependencies are used in many emerging application domains, such as the identification of data inconsistencies or patterns of semantically related data, query rewriting, and so forth. They can approximate the canonical definition of functional dependency (fd) by relaxing on the data comparison (i.e., by considering data similarity rather than equality), on the extent (i.e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002